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The nucleocapsid (N) protein of several members within the order Nidovirales localizes to the nucleolus 
during infection and after transfection of cells with N genes. However, confocal microscopy of N protein 
localization in Vero cells infected with the severe acute respiratory syndrome coronavirus (SARS-CoV) or 
transfected with the SARS-CoV N gene failed to show the presence of N in the nucleoplasm or nucleolus. Amino 
acids 369 to 389, which contain putative nuclear localization signal (NLS) and nucleolar localization signal 
motifs, failed to restore nuclear localization to an NLS-minus mutant Rev protein. These data indicate that 
nuclear localization is not a conserved property among all nidoviruses. 


Severe acute respiratory syndrome (SARS) is caused by an 
enveloped, positive-stranded RNA virus, the SARS coronavi- 
rus (SARS-CoV). Along with the arteriviruses, the coronavi- 
ruses are placed in a single order, Nidovirales (3). Even though 
nidovirus replication is restricted to the cytoplasm, the nucleo- 
capsid proteins (N) of two arteriviruses, porcine reproductive 
and respiratory syndrome virus (PRRSV) and equine arteritis 
virus, have been reported to localize to the nucleolus during 
infection (19, 20, 23, 25). An identical cytoplasmic-nucleolar 
distribution pattern has also been reported for the N proteins 
of several coronaviruses, including representative members of 
group I (transmissible gastroenteritis virus [TGEV]), group IIT 
(mouse hepatitis virus [|MHV]), and group ITI (infectious bron- 
chitis virus [IBV]) viruses (11, 24). These arterivirus and coro- 
navirus N proteins, when expressed alone or fused to the red- 
shifted enhanced green fluorescent protein (EGFP), also 
localize to the nucleolus, demonstrating that translocation 
across the nuclear pore complex and accumulation in the nu- 
cleolus are independent of other viral proteins (11, 20, 23). 
Once inside the nucleolus, the PRRSV and IBV N proteins 
colocalize with major nucleolar proteins, including nucleolin 
and fibrillarin (4, 25). The capacity of N to modulate nucleolar 
function may represent a viral strategy that diverts biosynthetic 
resources from the dividing nucleus to the cytoplasm, the site 
of virus replication (10, 11, 21, 25). There also appears to be a 
mechanism for the export of N from the nucleolus and back to 
the cytoplasm. The N proteins of PRRSV and equine arteritis 
virus accumulate in the nucleus in response to the treatment of 
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cells with lemptomycin B (LMB) (20, 23), an inhibitor of the 
export shuttle protein CRM1 (7). 

The classical scheme for the translocation of a protein from 
the cytoplasm through the nuclear pore complex and into the 
nucleoplasm is initiated through the interaction between a 
nuclear localization signal (NLS) on the cargo protein with the 
NLS-binding site on importin-a (reviewed in references 2 and 
8). Classical NLS sequences are enriched in basic amino acids, 
such as lysines and arginines, and generally conform to one of 
three types, known as monopartite (pat4 and pat7) and bipar- 
tite motifs (9, 17). The pat4 NLS is defined as a continuous 
stretch of four basic amino acids (lysine or arginine) or three 
basic amino acids associated with histidine or proline. The pat7 
NLS starts with proline followed within three residues by a 
segment containing three basic residues out of four. The “bi- 
partite” motif consists of two basic amino acids, a 10-amino- 
acid spacer, and a 5-amino-acid segment containing at least 
three basic residues (reviewed in references 9 and 17). The 
nuclear transport of the PRRSV N protein is dependent on a 
single pat7 NLS, 41-PGKKNKK, which overlaps the RNA- 
binding domain (20, 22, 25). Site-directed scanning mutagen- 
esis identified the requirement of the four lysine residues, 
43-KK and 46-KK, for the transport of N into the nucleus (20). 
A second NLS, 10-KRKK, is located upstream and functions as 
a cryptic NLS which becomes accessible following a conforma- 
tional change in the N protein (20, 21). The nucleolus is not a 
membrane-bound organelle; therefore, the localization of a 
protein to the nucleolus occurs by diffusion through the nucle- 
oplasm and accumulation in the nucleolus via a trans-acting 
nucleolar targeting signal. Viral proteins frequently combine 
an NLS and nucleolar targeting signal into a single localization 
signal sequence, often referred to as a compact nucleolar lo- 
calization signal (NoLS) sequence (14, 15). Compact NoLS 
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38-PKORRPQ (pat7) 


257-KKPROKRTA 
KKPR (pat4) 
PROKRTA (pat7) 


369-PKKDKKKKTDEAQPLPQRQKKQ 
KKKK (pat4) 
PKKDKKKK (pat7) 
PQRQKKQ (pat7) 
KKKKTDEAQPLPQRQKK (bipartite) 
KKKTDEAQPLPQROKKQ (bipartite) 


FIG. 1. NLS motifs in the SARS-CoV N protein. Peptide sequence 
analysis of the SARS-CoV N protein Urbani isolate (GenBank acces- 
sion no. AY278741) was performed using the Web-based program 
PSORT (17). The rules for defining pat4, pat7, and bipartite motifs are 
discussed in the introductory comments of the text. 


sequences are typically no longer than 30 amino acids and 
possess at least 9 basic amino acids, including at least one NLS 
motif (14). 

When analyzed for the presence of NLS-like signal se- 
quences using the computer program PSORT (17), the 422- 
amino-acid SARS-CoV N protein is shown to possess as many 
as eight NLS motifs, which are distributed between amino 
acids 38 to 44, 257 to 265, and 369 to 389 (Fig. 1). Embedded 
within the 369-389 domain are five NLS motifs, including a 
lysine-enriched peptide sequence, 369-PKKDKKK-375, which 
possesses similarities to the pat7 NLS in PRRSV N (20), and 
the well-characterized NLS, 126-PKKKRLYV, found in the sim- 
ian virus 40 large T antigen (13). In addition, the 369-375 
region resembles the putative NLS sequence, 360-PKKEKKL, 
reported for the IBV N protein (24). The 369-389 region 
shares similarities with NoLS domains found in nucleolar pro- 
teins of other viral proteins, including human immunodefi- 
ciency virus Rev and Tat, human T-cell leukemia virus Rex, 
and the capsid protein of Semliki Forest virus (6, 14, 15, 21). 

The localization of SARS-CoV N during infection was stud- 
ied using confocal microscopy of infected cells stained with SA 
46-4, a SARS-CoV N protein-specific monoclonal antibody 
(MAb) prepared in our laboratory against recombinant N pre- 
pared from the SARS-CoV Urbani isolate. Nuclei were iden- 
tified using the nucleic acid stain TO-PRO-3 (Molecular 
Probes) (5). Vero cells (American Type Culture Collection) 
grown on coverslips were mock infected or infected with 
SARS-CoV (Urbani isolate) at a multiplicity of infection of 
0.1. Coverslips were removed at 6, 12, and 24 h after infection, 
washed with phosphate-buffered saline (PBS), fixed for 10 min 
with 2% paraformaldehyde in PBS, and then stained with SA 
46-4 followed by anti-mouse immunoglobulin G antibody con- 
jugated to AlexaFluor 594 (Molecular Probes). All antibodies 
were diluted 1:500 in blocking buffer (PBS containing 3% 
normal goat sera and 0.2% saponin), and incubations were 
performed for 1 h at room temperature followed by extensive 
washing in PBS. Cells were counterstained with TO-PRO-3, 
diluted in blocking buffer, and then viewed on a Zeiss LSM 510 
confocal microscope. In each experiment at least 50 infected 
cells in 10 distinct fields were analyzed. Representative results 
at 24 h after infection are presented in Fig. 2. A group of 
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infected cells, presented in Fig. 2A, shows no evidence for 
anti-N staining in the nucleus or nucleolus. A higher magnifi- 
cation of a representative cell is presented in Fig. 2B, C, and D. 
We found no colocalization between SA 46-4 staining and the 
nuclear stain, TO-PRO-3. The same results were obtained 
from cells fixed and stained with SA 46-4 and TO-PRO-3 at 6 
and 12 h after infection (data not shown). 

Wurm et al. (24) reported the nucleolar localization of coro- 
navirus N proteins in Vero cells transfected with N genes from 
TGEV, MHV, and IBV and stained with anti-N antibodies. 
IBV N tagged with EGFP also localized to the nucleolus of 
Vero cells (11). We performed similar studies of the SARS- 
CoV N gene expressed in Vero cells. A cDNA containing the 
N gene of SARS-CoV Urbani isolate was cloned into pIRES- 
EGFP, pEGFP-N1, and pEGFP-C2 eukaryotic expression vec- 
tors (Clontech). The pIRES-EGFP vector construct, pSARS- 
N-IRES-EGFP, through the incorporation of an internal 
ribosome entry site (IRES) separating the SARS-CoV N and 
EGFP genes, expresses EGFP and N as separate proteins. The 
other vectors express N as an EGFP fusion protein with the N 
protein fused in frame to the amino (SARS-N-EGFP) or car- 
boxyl (EGFP-SARS-N) end of EGFP. The transfection of 
plasmid DNA into Vero cells was performed using Lipo- 
fectamine (Invitrogen) according to the manufacturer’s direc- 
tions. A representative cell at 24 h after transfection with 
pSARS-N-EGFP is shown in Fig. 3A, B, and C. Confocal 
microscopy showed that SARS-N-EGFP fluorescence was re- 
stricted to the cytoplasm with no visual evidence for N in the 
nucleolus. Similar results were obtained for Vero cells trans- 
fected with pSARS-N-IRES-EGFP and stained with the anti-N 
MAb SA 46-4 (data not shown). Hiscox et al. (11) reported 
only a small percentage of cells with IBV N protein in the 
nucleus/nucleolus. In cells transfected with pSARS-N-EGFP 
or pEGFP-SARS-N, we noticed that a small number of cells 
contained at least some EGFP fluorescence in the nucleo- 
plasm, but not in the nucleoli. A single cell exhibiting EGFP 
fluorescence in the nucleus and stained with SA 46-4 is shown 
in Fig. 3D. The intensity of red (SA 46-4) and green (EGFP) 
fluorescence taken along a single axis through the cytoplasm 
and nucleus is presented in the profile below the photomicro- 
graph. The fluorescence intensity profiles show that red fluo- 
rescence is primarily restricted to the cytoplasm (Fig. 3D), 
whereas EGFP fluorescence can be found in both nuclear and 
cytoplasmic compartments but is restricted from the nucleoli. 
The possibility that EGFP fluorescence in the nucleus was the 
result of free EGFP was confirmed by performing a Western 
blot assay of Vero cells transfected with pSARS-N-EGFP or 
pEGFP-SARS-N constructs. Proteins in cell lysates were sep- 
arated by sodium dodecyl sulfate-polyacrylamide gel electro- 
phoresis, transferred to a nylon membrane, and probed with 
anti-GFP antibody. The blot showed the presence of full- 
length N-EGFP and EGFP-N fusion proteins, as well as small- 
er-sized products, including immunoreactive proteins which 
migrated the same as free EGFP (data not shown). Therefore, 
the presence of EGFP in the nucleus of some transfected cells 
is likely the result of the proteolytic cleavage of the SARS-N- 
EGFP fusion protein and release of smaller-sized EGFP-la- 
beled fragments. Collectively, the results from infected and 
transfected cells indicate that SARS-CoV N does not appear to 
target the nucleolus or nucleus. 
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FIG. 2. Distribution of N in SARS-CoV-infected cells. (A) Confocal microscopy of a cluster of infected cells at 24 h after infection. Viral 
antigen was detected using AlexaFluor 594-labeled SA 46-4 anti-N MAb (red). Cells were counterstained with the nuclear stain TO-PRO-3 (blue). 
(B, C, and D) Images of a single representative infected cell at 24 h after infection. (B) AlexaFluor 594-labeled SA 46-4; (C) TO-PRO-3 staining; 
(D) merged image combining SA 46-4 and TO-PRO-3 staining. An 0.8-u.m slice through the nucleus is shown in each image. Images were obtained 


with a 63 oil objective, and panels B, C, and D are magnified x2. 


One possible explanation for the absence of detectable 
amounts of N in the nucleus is that N enters the nucleus but is 
rapidly exported. Therefore, under steady-state conditions 
only a small quantity of N would be visible in the nucleus at any 
one time. In order to test this possibility, we treated psARS- 
N-EGFP-transfected cells with the nuclear export blocker 
LMB, an antibiotic which inhibits CRM1, the major export 
shuttle protein (7, 18). LMB, at concentrations up to 50 nM, 
was added to Vero cells between 4 and 18 h after transfection. 
After addition of LMB, the live cells were then followed under 
a fluorescence microscope for an additional 24 h. Cells trans- 
fected with a recombinant equine infectious anemia virus Rev 
(ERev) cDNA fused to EGFP (pEGFP-ERey) were included 
as a positive control. The continuous shuttling of ERev back 
and forth between the cytoplasm and the nucleus is dependent 
on single NLS and nuclear export signal domains (16, 18). The 
majority of Vero cells transfected with pERev-EGFP showed 
the presence of ERev-EGFP fusion protein in the cytoplasm 
(Fig. 4A). In the presence of 10 nm LMB, the distribution 
pattern shifted towards the accumulation of ERev-EGFP in 
the nucleus (Fig. 4B). Accumulation of ERev-EGFP in the 
nucleus appeared as early as 6 h after the addition of LMB. 
Figure 4C and D show representative results for cells trans- 
fected with pSARS-N-EGFP and treated with LMB. The ad- 


dition of LMB did not alter the cytoplasmic distribution of 
SARS-N-EGFP. The same result was obtained for cells trans- 
fected with psARS-N-IRES-EGFP and stained with SA 46-4 
(data not shown). LMB, when added as early as 4 h after 
transfection and at concentrations as high as 50 nM, did not 
affect the localization properties of SARS-N-EGFP. These 
data further support the notion that the SARS-CoV N protein 
is not translocated into the nucleus. 

Putative NLS sequences in the N proteins of IBV, TGEV, 
and MHV coronaviruses are found at amino acids 340-RKRK, 
391-PKPORKR, and 360-PKKEKKL, respectively (11, 24). 
The corresponding region in SARS-CoV N protein is located 
in a lysine-rich region, between amino acids 369 and 389 (Fig. 
1). Furthermore, the peptide sequence 369-PKKDKKK of 
SARS-CoV N is nearly identical to the well-characterized NLS 
in PRRSV N protein (20) as well as a putative NLS in the N 
protein of IBV (24). To determine if the 369-389 peptide 
possessed nuclear transport activity, we tested the ability of the 
369-389 peptide to substitute for the NLS of ERev. The pat4 
NLS of ERev was removed by deleting six amino acids, 159-K 
RRRKHL, from the C terminus. The construct, pERev(NLS- 
minus)N(369-389)EGFP, was made by ligating the cDNA cor- 
responding to the SARS N 369-389 peptide to the C-terminal 
end of the mutant ERev and followed by EGFP. In Vero cells 
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FIG. 3. Localization of SARS-CoV N-EGFP in Vero cells. (A, B, and C) Confocal microscopic images of a single cell at 24 h after transfection 
with pSARS-N-EGFP. (A) EGFP fluorescence; (B) TO-PRO-3 fluorescence; (C) merged image. (D) Image of a single cell transfected with 
pSARS-N-EGFP and exhibiting EGFP fluorescence in the nucleus. The cell was counterstained with AlexaFluor 594-labeled SA46-4 anti-N MAb 
(red). Below the image is a profile showing the intensity of EGFP (green line) and AlexaFluor 594 (red line) fluorescence along the axis shown 


by the dotted arrow in the image. 


transfected with pERev(NLS-minus)N(369-389)EGFP, the 
chimeric EGFP protein localized to the cytoplasm (Fig. 4E). 
Blocking CRM1 by the addition of 10 nM LMB did not alter 
the cytoplasmic distribution of the chimeric protein (Fig. 4F). 
These data indicate that the 369-389 region of SARS-CoV N 
lacks the ability to substitute for the NLS of a known nucleo- 
cytoplasmic protein. 

Presumably, the translocation of TGEV, MHV, and IBV 
coronavirus N proteins across the nuclear pore complex and 
into the nucleoplasm is dependent on the activities of classical 
NLS domains (11, 24). The presence of eight NLS motifs, 
scattered between three different regions of the SARS-CoV N 


polypeptide, provided a compelling argument for the nuclear 
and nucleolar localization of SARS-CoV N. However, the re- 
sults from this study of N protein localization in Vero cells 
infected with SARS-CoV or transfected with constructs ex- 
pressing the N gene showed no evidence for the localization of 
SARS-CoV N to the nucleoplasm or nucleolus. The absence of 
NLS activity within the lysine-rich 369-389 domain is puzzling. 
We can propose three mechanisms for the absence of NLS 
activity by the 369-389 peptide. First, even though NLS se- 
quences frequently possess negatively charged residues, addi- 
tional negative charges conferred by acidic amino acid residues 
or phosphorylated serine, threonine, or tyrosine residues can 
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FIG. 4. Effect of LMB on localization of SARS-N-EGFP in live cells. (A and B) Cells transfected with the control construct, pEGFP-ERev. 
(C and D) Cells transfected with psARS-N-EGFP. (E and F) Cells transfected with the Rev-SARS chimeric construct pRev(NLS-minus)SARS-N 
(368-389). LMB, at a concentration of 10 nM, was added at 4 h after transfection. The results following LMB treatment are shown in the panels 


on the right. Photomicrographs were taken at 18 h after transfection. 


lower the affinity of the interaction between the NLS and 
NLS-binding site on importin-a (12). An aspartic acid residue 
is found at position 372, within the pat7 NLS motif of the 
SARS CoV-N. Additional, negatively charged amino acids are 
found at positions 378 and 379 (Fig. 1). One more negative 
charge is made available through the potential phosphoryla- 
tion of threonine at position 377 (1). Therefore, four negatively 
charged residues may be sufficient to neutralize both mono- 
partite and bipartite NLS sequences within the 369-389 region. 
A second possibility is that the NLS is in a poor conformation 
to be recognized by importin-«. And finally, NLS recognition 
may be sterically blocked by the interaction between the 369- 
389 region and a cytoplasmic protein. In this study we did not 
evaluate the nuclear localization properties of the NLS motifs 
located in the other regions of the SARS-CoV N protein. Since 
the SARS-CoV N protein does not appear to enter the nu- 
cleus, we can assume that these NLS motifs are inactive or 
inaccessible or that there are additional domains which are 
responsible for retaining N in the cytoplasm. 
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